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Abstract 

In this paper we present two original methods for recognizing textual inference. First 
one is a modified resolution method such that some linguistic considerations are in- 
troduced in the unification of two atoms. The approach is possible due to the recent 
methods of transforming texts in logic formulas. Second one is based on semantic rela- 
tions in text, as presented in WordNet. Some similarities between these two methods 
are remarked. 
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1 Introduction 

The recognition of textual inference is one of the most complex task in Natural 
Language Understanding. Thus, a very important problem in some computa- 
tional linguistic applications (as Question Answering, summarization, segmenta- 
tion of discourse, coherence and cohesion of a discourse and others) is to establish 
if a sentence follows from a text. That means in many application it is impor- 
tant to establish if some sentences which are not existing in text are "logical" 
implied (can be inferred) by this text. The importance of text inference in com- 
putational linguistic is proved by the existence of RTE conferences (Recognizing 
Textual Entailment, http:/ / www.pascal-network.org/ Challenges/ RTE/), with 
the forth edition this year, where the main task is to establish the textual en- 
tailment relation. For RTE-1 contest data set includes 1367 English T, H pairs 
(567 for training stage in learning methods and 800 for test). Here the task 
is to determine if the meaning of one text (the entailed hypothesis, H) can be 
inferred from the meaning of the other text (the entailing text, T). 

On the other hand is well known that a linguistic text can be represented 
by a set of logical formulas, called logic forms. Various method were given for 
associating a logical formula with a text: [5], [TT], [13], [5]. From logical point 
of view, if each sentence is represented as a formula, proving a textual infer- 
ence consists in showing that a logical formula is deducible from a set of others 
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formulas. The problem is for First Order Logic a classical (semidecidable) prob- 
lem. In last years, when text mining is very important in many AI applications, 
text inference from both point of view, theorem proving and from linguistics 
perspective, is a very active field of research. In [T^ is presented a system par- 
ticipating in the RTE-1 competition, using some world knowledge axioms and 
a theorem proving tool. 

Let us denote entailment relation between a text T and a sentence or a group 
of sentences H hy T =^ H. 

In this paper we propose two method to solve the problem of establishing 
if T : first is obtained from the classical resolution refutation method, 

completing the unification of two atoms with some linguistic considerations 
(Modified Resolution Method or MRM). Our method differs of [10] by the fact 
that it does not need learning stage and it does not need a graph representation 
and evaluation. The weight (cost) of a deduction is obtained only from the 
weights (costs) of each modified resolution steps. At his turn, the cost of a 
step of resolution is obtained by similarity considerations using some linguistic 
tools as WordNet [4J and Word::Similarity [8j. No background knowledge [Ij is 
needed. 

The second method is based on lexical chains (paths) for entailment spanning 
the text T and the text H ( Lexical-chains Based Method or LBM). A system of 
rules for construction of lexical rules corresponding to entailment is established. 
We claim that MRM and LBM produce similar results. 

In section 2 we will define our modified unification of two atoms method, 
our lexical resolution rule and modified resolution method (MRM). 

In section 3 we will describe LBM method and we will propose another 
definition for text inference based on the cohesion of texts. 

2 Text inference as theorem proving. 

Consider a knowledge base formed by a set of natural language sentences, K . 
Let define a set of inferences rules which is sound, in the sense that it derive 
true new sentences when the initial sentences in K are true. It is a long debate 
about formalisms to represent knowledge such that above desiderata be fulfilled 
[2]. We will use here the method proposed by [IT] of obtaining logical forms 
(in fact, logical formulas) from sentences expressed in natural language. In this 
method each open class word in a sentence (that means noun, verb, adjective, 
adverb) is transformed in a logic predicate (atom). We consider, additionally, 
that the constants are denoted by the names of words they represent. For 
these atoms we propose a new algorithm for unification which modifies the 
classical Robinson unification algorithm by adding some lexical relaxations. The 
semantic information is used in the way we define unification between two atoms, 
as described in the following section. 
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2.1 Lexical unification method for two atoms 

Lexical unification method of two atoms supposes that we have a lexical knowl- 
edge base where the similarity between two words is quantified. A such of lexical 
knowledge base is WordNet [1], a lexical resource which, from his construction 
in 1998 at Princeton University, is largely used in many linguistic application. 
Moreover, some connected resources are constructed (also free) which make use 
of WordNet easier. For example. Word:: similarity is an on-line interface which 
calculates the similarity between two words using some different similarity mea- 
sure, all these starting from WordNet facilities [9] , http: / /www.d.umn.edu~tpederse/similarity.html 
It offers the possibility to calculate similarity between two words, two words 
annotated with POS, or even two words annotated with POS and sense (in 
WordNet notation). Measures used to calculate similarity could be nine, the 
most well known are Path lenght, Leacok and Chodorow, Wu and Palmer and 
Resnik [3]. Of course, a maximal similarity is between words belonging to the 
same synset (concept). 

In the following algorithm we consider that each word of a natural language 
sentence is transformed in atom as in [11]. See our section 2.3. It can be 
seen there that the terms (the arguments) of an atom are always variables or 
constants. The classical unification of atoms is replaced by lexical unification., 
which depends on the similarity in the dictionary WordNet and which relies on 
this remark. Thus, two terms are unifiable if: they are equal, they are words in 
the same Wordnet synset or their similarity as words is bigger than a threshold. 

In the following algorithm we consider that sim{p,p') between two words 
p,p' is that obtained by the Word:: similarity interface. 



INPUT: Two atoms a = p{ti,...,tn) and a' = p' {t[, ...,t'.^), n < m, 
threshold t, threshold for a step r' . The names p and p' are also words in a 
lexical knowledge base. 

OUTPUT: Decision: The atoms are lexical unifiable (with the score W 
being bigger than the threshold r) and the unificator is a, OR they are 

not unifiable for the threshold r. 

Step 1. (7 = empty substitution, W=0. 

Step 2. If p = p' (similarity is maximal, =1) or sim{p,p') > r' 
then W -.— W + sim{p,p') ; go to Step 3 
else STOP: " a and a' are not lexical unifiable" 
Step 3. If (for each ti,i — 1, ...,n exists t'j in {t[, i'm} such that 
ti and t'j are lexical unifiable with the score W , W > t' and the composition of 
all unificators is a' OR for each t'j,j — 1, m exists ti in {^i, i„} such that 
ti and t'j are lexical unifiable with the score W , W > t' and the composition 
of all unificators is a'), and if the new score W (where each W' is added )is 
greater than the threshold r 
then 
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STOP: " a and a' are lexical unifiable and 
a := a composed with cr'" 
else 

STOP: " a and a' are not lexical unifiable" 

Let us observe that the two terms ti and t^- are lexical unifiable in the fol- 
lowing two cases. 

1. First cases are regular cases in FOPC: 

• terms are equal constants; 

• one is a variable, the other is a constant; 

• both are variables. 

In this case the score of lexical unification is 1 . 

2. In the second case, if ti and t'^ are two different constants, as they are 
words in KB, then they are unifiable if sim{ti, t'j) > t' . 

3. Additionally, the similarity sim{p,p') is big when p,p' are from the same 
synset in Wordnet. 

4. As a reviewer pointed up, for lexical unification kill (Oswald, Kennedy) is 
unifiable with kill (Kennedy, Oswald). This is true, however, is hard to obtain 
for the text T the transcription kill (Oswald, Kennedy) and for the hypothesis 
H kill (Kennedy, Oswald), when the same tool for the translation of a text in a 
logical formula is used. 

The similarity between two words is used to calculate a score for unifiability 
of two atoms. The test in this case is that the score is larger than a threshold 
T. The "assumption cost model" presented in [6] uses a similarity measure for 
some dependency graphs matching. The difference with our method is that they 
calculate all unificators and choose the best one (which minimizes a given cost). 
For the modified resolution method, we need to obtain the empty clause once. 
The "cost" of resolution is restricted to be low (the score is high), while the 
condition of step threshold is applied. 

2.2 Lexical resolution rule 

The lexical resolution rule LR, consists in considering of lexical unification of 
two atoms as replacing regular unification: 

Definition 

Two (disjunctive) clauses Ci and Cj provide by lexical resolution the (disjunc- 
tive) clause Cfc with the score W, written as 

^i^C-j \^lexical resolution ,shoTtly, Ci,Cj \=lr C]^ 

\i Ci = I V c'i, Cj = -iZ' V , I and V are lexical unifiable with the score W and 
the unificator cr. The resulting clause is Ck = cr(4) V o'(c^-). 

Remark: by disjunctive clause we mean a disjunction of literals (negated or 
not negated atoms). 
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We will call modified resolution method the repeatedly application of lexical 
resolution rule . So, the modified resolution is the transitive closure of the lexical 
resolution. 

The following definition is a translation of Robinson's definition for classical 
resolution method: 

Definition 

A set of disjunctive clauses C obtained from formulas associated to sentences 
of a text is lexical contradictory for the threshold t if the empty clause [] is 
obtained from the set of formulas C by the modified resolution, and the sum of 
all scores of lexical resolution steps (rules) is bigger than t: 

C D 

As in the case of classical resolution, the modified resolution is a problem of 
search. If we impose in this search problem to choose each time the clauses with 
the biggest score of lexical resolution, we obtain the empty clause (in the case 
of a set of lexical contradictory clauses) with the biggest score of derivation. 

Let us resume the steps of demonstrating by modified resolution method 
that a text T entails the sentence H with the weight r, property denoted by 
T ^MRM.T H : 

• Translate T in a set of logical formulas T' and H in H' (as in the following 
subsection) . 

• Consider the set of formulas T' \Jneg{H'), where by neg{H') we mean the 
logical negation of formula H' 

• Find the set C of disjunctive clauses of the set of formulas T' and neg{H') 

• Verify if the set C is lexical contradictory for the threshold t. In this case 

2.3 Logical form derivation from sentences 

We will use the method established by jll] which is applied to texts which are 
part of speech tagged and syntactic analyzed. 
The method is the following: 

• A predicate is generated for every noun, verb, adjective and adverb (pos- 
sibly even for prepositions and conjunctions). The name of a predicate is 
obtained from the morpheme of word; 

• If the word is a noun, then the corresponding predicate will have as argu- 
ment a variable, as individual object. Example: person(x2). 
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• If the word is a verb, then the corresponding predicate will have as first 
argument an argument for the event (or action denoted by the verb). 
Moreover, if the verb is intransitive it will have as arguments two variables: 
one for the event and one for the subject argument. If the verb is transitive 
it will have as arguments three variables: one for the event, one for the 
subject and one for the direct complement. If the verb is ditransitive it 
will have as arguments four variables: two for the event and the subject 
and two for the direct complement and the indirect complement. 

• The arguments of verb predicates are always in the order: event, subject, 
direct object, indirect object, (the condition is not necessary for modified 
unification) 

• If the word is an adjective (adverb) it will introduce a predicate with the 
same argument as the predicate introduced for modified noun (verb). 

Example: man-made o6jec< is translated as: object(xl) AND man-made(xl) 

• If the word is a preposition or a conjunction it will introduce a predicate 
with the same argument as the modified word. 

Some transformation rules that create predicates and assign them arguments 
are presented in [TT]. These are obtained from the set of rules of the syntac- 
tic analyzer. For example, the rule for the introduction of noun predicate is 
ART NOUN — > noun{xi). The rule for introduction of adverb predicate is: 
VERB ADVERB — > verb{ei,xi,X2) AND adverb{ei). 

Let us consider the following example from [T2] : 

T: John and his son, George, emigrated with Mike, John's uncle, to US in 
1969 

H: George and his relative, Mike, came to America 
The logical form obtained for T is: 

John(xi) A son{x2) A George{x2) A emigrated{ei) A Agent{xi, ei) 

f\Agent{x2, ei) A Mike(xz) A uncle{xi, X3) A Location(ei, xa) 
AUS{xi) A Time{ei,X5) A 1969(x5) 
The logical form obtained for H is: 

George{xi) A relative{x2) A Mike{x2) A came(ei) A Agent{xi, ei) 

AAgent(x2, ei) A America^x^) A Location{ei, X3) 

Applying the unification lexical method for two atoms and lexical resolution 
rule for the obtained disjunctive clauses, we obtain empty clause, as follows. 
First, the set of clauses for neg{H)) is formed by only one disjunctive clause: 
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-iGeorge{xi) V -^relative{x2) V -iMike{x2) V -icame(ei) V -^Agent{xi, ei) 

\/-'Agent(x2, ei) V -^America{xj,) V -iLocation^ei, X2) 

Then, if we apply modified unification between the following pairs of atoms, 
the empty clause is obtained: 

relative{x2), uncle{xi, X3) 
America{x3), U S{x4), 
emigrated{ei) , came(ei). 

The similarities for the pair relative, uncle, for the pair America, US and for 
the pair emigrated, came are calculated with Word::similarity. So T =^mb.ai,t 
H where the threshold r must be be smaller then the sum of these similarities. 

Let us remark that in |12j the result is obtained using additionally 6 axioms. 

3 Entailment on linguistic bases 

In this section we will introduce another definition for entailment between a text 
T and a sentence H . This definition is based on the concept of lexical paths 
and on the semantical relations presented on WordNet. 

In the huge knowledge base which is WordNet there are many semantic 
relations which are defined between synsets of nouns, verbs, adverbs and of 
adjectives. Synsets in WordNet (or concepts) are set of words which are: 

a) with the same POS and 

b) are similar as meaning (or synonyms). 

The most well known semantical relation is the relation IS- A between synsets 
of nouns (or of verbs). The relations ENTAIL and CAUSE-TO defined only 
between synsets of verbs, are the most suited for purposes of entailment study. 

We will define a lexical path for entailment between two words wi and W2 , 
denoted by LPE{wi, W2), a. path of the form: 

LPE{wi,W2) =CiriC2r2 r^^iCfc 

where wi is from the synset ci, W2 is from the synset Ck and each relation rj 
is a semantical WordNet relation of the form IS- A or ENTAIL or CAUSE-TO 
between synsets Cj and Cj+i. A lexical path for entailment, LPE{wi,W2), can 
be described as a regular expression of the form: 

ciriC2r2 Tk-iCk e ((< concept > {IS-A))*{< concept > (ENTAIL))* 

((< concept > {IS - A))*{< concept > (CAUSE - TO)*)* < concept > 

The relations IS- A, ENTAIL and CAUSE-TO are transitive and no simetric. 
Thus the paths LPE(wi,W2) and all the concepts defined using they have an 
orientation from wi to 'W2- 
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Definition 

T ^LPE,T H if card({LP-E('u;i, m;2) \ wx & T, & H}) is greater tlian a 
given threshold r. 

A method to construct a path LPE{w\, W2) is to apply the following rules: 

• Prom ci IS — A ci and c^IS — A C3 it results c\ IS — A C3 

• Prom ciIS-A C2 and C2 ENTAIL C3 it results ci ENTAIL cs 

• Prom ci ENTAIL C2 and C2 /5 - A C3 it results ci ENTAIL C3 

• From ci ENTAIL C2 and C2 ENTAIL C3 it results ci ENTAIL C3 

• From ci IS"- A ca and ca CAUSE -TO C3 it results ci CAUSE -TO C3 

• From ci CAUSE -TO ca and ca /S*- A C3 it results a CAUSE -TO C3 

• From ci CAUSE -TO C2 and C2 CylZ/S'i;-™ C3 it results ci C^f/S'i;- 
TO C3 

• From ci CAUSE - TO C2 and ca ENTAIL C3 it results ci ENTAIL C3 

• From ci ENTAIL ca and C2 CAC/5i; - TO C3 it results ci ENTAIL C3 
We claim that the following theorem holds: 

Theorem 

For each given threshold r there exists a threshold r' such that the relation 
T =^LPE,T H holds iff T =^mrm,t' H holds. 



4 Conclusions and further work 

In this paper we presented two methods for recognizing textual inference: one 
is from the logic resolution area, using a modified unification algorithm, the sec- 
ond is a pure semantic lexical method and uses the big facilities offered by the 
huge semantical dictionary WordNet. We consider that the meaning of these 
methods has common roots: the similarity between two atoms in Tinification 
algorithm and the lexical path for entailment are calculated considering seman- 
tical relations which exist between concepts (synsets) in WordNet. A study of 
the relation between t, t' is in our attention. 

The combined methods in Artificial Intelligence between approaches so dif- 
ferent, as Logic and Linguistics, are very largely developed in the last time. The 
present paper belongs to this category of combined methods. 
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